Picture for Wen Xiao

Wen Xiao

University of British Columbia

Smart-Insertion-V: Photorealistic Video Insertion via a Closed-Loop Feedback Dual-Stream Framework

Add code
May 22, 2026
Viaarxiv icon

CopT: Contrastive On-Policy Thinking with Continuous Spaces for General and Agentic Reasoning

Add code
May 19, 2026
Viaarxiv icon

BPC-Net: Annotation-Free Skin Lesion Segmentation via Boundary Probability Calibration

Add code
Apr 07, 2026
Viaarxiv icon

MMGR: Multi-Modal Generative Reasoning

Add code
Dec 17, 2025
Figure 1 for MMGR: Multi-Modal Generative Reasoning
Figure 2 for MMGR: Multi-Modal Generative Reasoning
Figure 3 for MMGR: Multi-Modal Generative Reasoning
Figure 4 for MMGR: Multi-Modal Generative Reasoning
Viaarxiv icon

SwiReasoning: Switch-Thinking in Latent and Explicit for Pareto-Superior Reasoning LLMs

Add code
Oct 06, 2025
Viaarxiv icon

Scaling Up Audio-Synchronized Visual Animation: An Efficient Training Paradigm

Add code
Aug 05, 2025
Viaarxiv icon

R-KV: Redundancy-aware KV Cache Compression for Training-Free Reasoning Models Acceleration

Add code
May 30, 2025
Viaarxiv icon

VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection

Add code
May 26, 2025
Figure 1 for VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection
Figure 2 for VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection
Figure 3 for VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection
Figure 4 for VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection
Viaarxiv icon

HeadInfer: Memory-Efficient LLM Inference by Head-wise Offloading

Add code
Feb 18, 2025
Figure 1 for HeadInfer: Memory-Efficient LLM Inference by Head-wise Offloading
Figure 2 for HeadInfer: Memory-Efficient LLM Inference by Head-wise Offloading
Figure 3 for HeadInfer: Memory-Efficient LLM Inference by Head-wise Offloading
Figure 4 for HeadInfer: Memory-Efficient LLM Inference by Head-wise Offloading
Viaarxiv icon

Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey

Add code
Dec 30, 2024
Figure 1 for Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey
Figure 2 for Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey
Figure 3 for Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey
Figure 4 for Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey
Viaarxiv icon